Non-Euclidean Dissimilarities: Causes, Embedding and Informativeness
نویسندگان
چکیده
In many pattern recognition applications object structure is essential for the discrimination purpose. In such cases researchers often use recognition schemes based on template matching which lead to the design of non-Euclidean dissimilarity measures. A vector space derived from the embedding of the dissimilarities is desirable in order to use general classifiers. An isometric embedding of the symmetric non-Euclidean dissimilarities results in a pseudo-Euclidean space. More and better tools are available for the Euclidean spaces but they are not fully consistent with the given dissimilarities. In this chapter first a review is given of the various embedding procedures for the pairwise dissimilarity data. Next the causes are analyzed for the existence of nonEuclidean dissimilarity measures. Various ways are discussed in which the measures are converted into Euclidean ones. The purpose is to investigate whether the original non-Euclidean measures are informative or not. A positive conclusion is derived as examples can be constructed and found in real data for which the non-Euclidean characteristics of the data are essential for building good classifiers.1 Robert P.W. Duin Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, The Netherlands, e-mail: [email protected] Elżbieta P ↪ ekalska School of Computer Science, University of Manchester, United Kingdom, e-mail: [email protected] Marco Loog Faculty of Electrical Engineering, Mathematics and Computer Sciences, Delft University of Technology, The Netherlands, e-mail: [email protected] 1 This chapter is based on previous publications by the authors, [16, 17, 19, 21, 23, 43] and contains text, figures, equations and experimental results taken from these papers.
منابع مشابه
Non-Euclidean Dissimilarities: Causes and Informativeness
In the process of designing pattern recognition systems one may choose a representation based on pairwise dissimilarities between objects. This is especially appealing when a set of discriminative features is difficult to find. Various classification systems have been studied for such a dissimilarity representation: the direct use of the nearest neighbor rule, the postulation of a dissimilarity...
متن کاملRicci flow embedding for rectifying non-Euclidean dissimilarity data
Pairwise dissimilarity representations are frequently used as an alternative to feature vectors in pattern recognition. One of the problems encountered in the analysis of such data, is that the dissimilarities are rarely Euclidean, while statistical learning algorithms often rely on Euclidean dissimilarities. Such non-Euclidean dissimilarities are often corrected or a consistent Euclidean geome...
متن کاملOn Not Making Dissimilarities Euclidean
Non-metric dissimilarity measures may arise in practice e.g. when objects represented by sensory measurements or by structural descriptions are compared. It is an open issue whether such non-metric measures should be corrected in some way to be metric or even Euclidean. The reason for such corrections is the fact that pairwise metric distances are interpreted in metric spaces, while Euclidean d...
متن کاملNon-Euclidean dissimilarity data in pattern recognition
This thesis addresses problems in dissimilarity (proximity) learning, particularly focusing on identifying the sources and rectifying the non-Euclidean dissimilarity in pattern recognition. We aim to develop a framework for analyzing the non-Euclidean dissimilarity by combining the methods from differential geometry and manifold learning theory. The algorithms are applied to objects represented...
متن کاملGeneralized Non-metric Multidimensional Scaling
We consider the non-metric multidimensional scaling problem: given a set of dissimilarities ∆, find an embedding whose inter-point Euclidean distances have the same ordering as ∆. In this paper, we look at a generalization of this problem in which only a set of order relations of the form dij < dkl are provided. Unlike the original problem, these order relations can be contradictory and need no...
متن کامل